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^ . Abstract 

■ The lower bound of Cramer and Rao is generalized to pairs of 
. families of probability distributions, one of which is escort to the other. 
I This bound is optimal for certain families, called ^-exponential in the 

■I— j- ' paper. Their dual structure is explored. 

o : 

"Sh' 1 Introduction 

1 ■ 

^ \ Aim of this paper is to translate some new results of statistical physics into 

^ ' the language of statistics. It is well-known that the exponential family of 

• probability distribution functions (pdfs) plays a central role in statistical 

•'-j . physics. When Gibbs introduced the canonical ensemble in 1901 [1^ he pos- 

! tulated a distribution of energies E of the form 
S ■ 

p{E) = exp(G - (3E) (1) 

where G is a normalization constant and where the control parameter (3 is the 
inverse temperature. Only recently |2], a proposal was made to replace (HJ 
by a more general family of pdfs. The resulting domain of research is known 
under the name of Tsallis' thermostatistics. Some of the pdfs of Tsallis' 
thermostatistics are known in statistics under the name of Amari's a-family 
jS] . The latter have been introduced in the context of geometry of statistical 
manifolds j3] . The appearance of the same family of pdfs in both domains is 
not accidental. The apparent link between both domains is clarified in the 
present paper. 
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The new notion introduced in Tsallis' thermostatistics is that of pairs of 
famihes of pdfs, one of which is escort of the other Some basic concepts 
of statistics can be generahzed by replacing at well-chosen places the pdf 
by its escort. In particular, we show in the next section how to generalize 
Fisher's information and, correspondingly, how to generalize the well-known 
lower bound of Cramer and Rao. Section 3 studies the statistical manifold of 
a family for which there exists an escort family satisfying the condition under 
which the generalized Cramer-Rao bound is optimal. This optimizing family 
has an afiine geometry. Since this is usually the characteristic property of an 
exponential family a generalization of the latter seems indicated. 

Section 4 shows how a strictly positive non-decreasing function of ]R_|_ 
determines a function which shares some properties with the natural loga- 
rithm and therefore is called below a ^-logarithm. The inverse function is 
called the 0-exponential. In Section 5 it is used to define the 0-exponential 
family in the obvious way, by replacing the exponential function exp by the 
(/)-exponential function. The standard exponential family is then recovered 
by the choice (j){x) = x, the a-family of Amari by (j){x) = x^^~^°'^^'^, the equi- 
librium pdfs of Tsallis' thermostatistics by the choice 4>{x) = x'^. 

The next three sections are used to establish the dual parametrization 
of the 0-exponential family and to discover the role of entropy functionals. 
Section 6 introduces a divergence of the Bregman type. In Section 7 it is used 
to prove the existence of an information function (or entropy functional) 
which is maximized by the 0-exponential pdfs. Section 8 introduces dual 
parameters — in statistical physics these are energy and temperature. The 
paper ends with a short discussion in Section 9. 

There have been already some attempts to study Tsallis' thermostatis- 
tics from a geometrical point of view. Trasarti-Battistoni |.6] conjectured a 
deep connection between non-extensivity and geometry. He also gives gen- 
eral references to the use of geometric ideas in statistical physics. Several 
authors [3 El E] have introduced a divergence belonging to Csiszar's class 
of f-divergences, which leads to a generalization of the Fisher information 
metric adapted to the context of Tsallis' thermostatistics. The relation with 
the present work is unclear since here the geometry is determined by a di- 
vergence of the Bregman type. Also the recent work of Abe ^U] seems to be 
unrelated. 
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2 Estimators and escort pdfs 



Fix a measure space Q, fi. Let Aii{fi) denote the convex set of all probability 
distribution functions (pdfs) p normalized w.r.t. /i 



di2{x) p{x) = 1. (2) 
Expectations w.r.t. p are denoted Ep 

Ep/= [ dfi{x)p{x)f{x). (3) 



Fix an open domain D of M". Consider a family of pdfs pg, parametrized 
with 9 in D. The notation Eg will be used instead of Ep^. Simultaneously, 
a second family of pdfs {Pg)g<zD is considered. It is called the escort family. 
The notation ¥g will be used instead of Ep^. 

Recall that the Fisher information is given by 

UO) = E,(|^log(p,)) 

' pe{x) 89^ 09' ■ ^ ' 

A generalization, involving the two families of pdfs, is 

gm = y/Ma^)^^^. (5) 

Clearly, the expression coincides with Q if Pe = Pe- 

The following definition is a slight generalization of the usual definition 
of an unbiased estimator. 

Definition 1 An estimator of the family {pe)e&D is a vector of random vari- 
ables Ck with the property that there exists a function F such that 

Eeck = -^F{9), k = l,---,n. (6) 

The function F will be called the scale function of the estimator. 

The estimator is unbiased if F{9) = ^9^9^ so that EgCfc = 9^. The well-known 
lower bound of Cramer and Rao can be written as 

m^m' [EgCfcQ - (EeCfc)(EgQ)] ^ 1 . . 

["^"' j&r " "'"'^-(»'' 

for arbitrary u and f in M". 

A similar lower bound, involving the information matrix qm instead of 
Fisher's Iki, is now formulated. 
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Theorem 1 Let be given two families of pdfs {pejeeo one? {P0)eeD cor- 
responding expectations Kg and ¥g. Let c be an estimator of {pe)eeD, with 
scale function F. Assume the regularity condition 

holds. Let gki{6) be the information matrix introduced before. Then, for all 
u and V in M" is 

u^u'{¥eckci-{¥eck)i¥eci)] ^ 1 



The bound is optimal (in the sense that equality holds whenever u = v) if 
there exist a normalization function Z > and a function G such that 

^ :Pe{x) = Z{d)Pe{x)i-[G{e) - 0'ci{x)] (10) 



holds for all k in [l..m], for all 6 E D, and for fi-almost all x. In that case, 
c is an estimator of {P0)0qd with scale function G 

^ dG 

^^^^ = ofk- (11) 

Proof Let 

1 d 

= -fT^Ps Yk = Ck- ¥eCk. (12) 

From Schwartz's inequahty foUows 

{¥eu''Ykv'Xif < {¥eu'YkU%) (F,t;^X,t;'X;) . (13) 
The l.h.s. equals, using (jH)), 
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The first factor of the r.h.s. equals 

¥eu'Yku'Yi = MV[F,CfcQ-(F,Cfc)(F,Q)]. (15) 
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The second factor of the r.h.s. equals 

¥ev''Xuv'Xi = v'^v'gkiie). (16) 

This proves 

Assume now that ()1U|) holds. Combining it with the regularity condition 
(jSJ shows that c is an estimator for the escort family, with scaling function 
G. This makes it possible to write (fTUI) as 

1 d 

:Pe{x) = ¥0Ck-Ck{x). (17) 



Z{9)Pe{x) 39^ 
In this way one obtains 



u^'u' [WeCkCi-iWeCkWeCi)] 



z{ey ' 



On the other hand is 

no) = -T^^eck 



= Z{e) j dii{x)Pe{x)cu{x)^[G{e)-e'ci{x)] 

= -Z{ef[¥ecuci-i¥eck)i¥eci)]. (19) 

Together with (fTH|) this shows equality in whenever u = v. 

□ 

It is not investigated whether (fTUj) is a necessary condition. For practical 
application of the lower bound one has to assume that c is also an estimator 
of the escort family {P0)g^D, with scale function G. The previous proposition 
shows that this is automatically the case when (fTnjl is satisfied. 

Example 1 Let fi be the Lebesgue measure restricted to [0, +oo) and let 

, . 2 



(20) 



with ^ > and = max{M,0}. The Fisher information liff) is divergent. 
Hence, the usual lower bound of Cramer and Rao is useless. 
Consider now the escort family 

Pe{x) = (21) 
5 



Then one calculates 



g{e) = ^(5e-13). (22) 



This fixes the r.h.s. of the inequality 

Let us estimate 6 via its first moment, with c{x) = 3x. One has Kgc = 6, 
Eec^ = {3/2)e\ F{e) = 6^/2, Fc = 36 and Fc^ = IS^^^ jhen © boils down 
to 

Fc^ - (¥c? = > — ^ -e^ ~ 0.4 e\ (23) 

^ ^ -4(5e-13) ^ ^ 

3 Statistical manifold 

The well-known example of a family with optimal estimator is the exponential 
family 

pe{x)=exp{G{9)-9'ck{x)) (24) 

with 

G{9) = - log / d/x(x)e-^'"'=(^). (25) 
Jn 

One sees immediately that 

^ :Pe(x) = peix) (4^Gi9) - Ck{x)) , (26) 



which is (|T7jl with 2'(^^) identically 1 and the escort pdf Pg equal to pg. This 
example motivates also the geometric interpretation of (fTUI) . in the form (fT7j) . 
as a linear map between tangent planes. The score variables dlogps/ 89^ of 
the standard statistical manifold are replaced by the variables 

^ ^:Pe{x). (27) 



Poix) 09'' 



They are tangent vectors of the concave function G{9) — 9^ci. The metric 
tensor of the latter function is a constant random variable. The geometry 
of the manifold of random variables {G{9) — 9^ci)q^j^ is transferred onto the 
family of pdfs {pe)e<^D- 

Note that the score variables have vanishing expectation ¥g. It is now 
obvious to define an inner product of random variables by 

{A,B)e = WeAB. (28) 
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Then one has 



1 dpe 1 dpe 



9ki{e). 



(29) 



Let g^''{9) denote the inverse of gki{0) (assume it exists). Then a projection 
operator ttq onto the orthogonal complement of the tangent plane is defined 

by 

1 dpe \ 1 dpg 



Tie A = A- g 

If (fTUj) is satisfied, then 
d 1 dpe 



kl 



Pede^ 



A 



Pe de^ 



de^ Pf) de^ 



dZ_ 

W 
dz_ 

W 

0. 



Ck) + z{e) 



¥aA. 



WW 



(30) 



^0Ck -Ck + g 
^eCk - Ck 



1 dpg 



1 dpe 
PbOO''' 



This follows also immediately from 

d 1 dpe 1 dZ 1 dpe 



(31) 



(32) 



That the derivatives of the score variables are linear combinations of the 
score variables and the constant random variable is usually the character- 
istic feature of the exponential family. This is a motivation to introduce a 
generalized notion of exponential family. 



4 0- logarithms and ^-exponentials 

In the next section the notion of exponential family is generalized to a rather 
large class of families of pdfs. This is done by replacing the exponential 
function by some other function satisfying a minimal number of require- 
ments. The latter function will be called a deformed exponential and will 
be denoted exp^. This has the advantage that the resulting expressions look 
very familiar, resembling those of the exponential family. 

Fix an increasing function (p of [0, +oo), strictly positive on (0, +oo). It 
is used to define the 0-logarithm In,^ by 

r 1 

lia^{u) = / dv——, u>0. (33) 
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Clearly, In^ is a concave function which is negative on (0, 1) and positive 
on (l,+oo). The inverse of the function In^ is denoted exp^. It is defined 
on the range of In^. The definition can be extended to all of M by putting 
ex-p^{u) = if M is too small and exp^ = +oo if u is too large. In case 
4>{u) = u for all u then ln<^ coincides with the natural logarithm and exp^ 
coincides with the exponential function. 
Given 0, introduce a function of M by 

ip{u) = 0(exp^(-u)) if u is in the range of ln<^ 
= if M is too small 

= +00 if u is too large. (34) 
Clearly is = ip{\n^{u)) for all m > 0. 
Proposition 1 One has for all u inK. 

< exp^{u) = 1+ / dvijj{v) 
. Jo 

dvip^v) < +00. (35) 

Proof First consider the case that [0, u) belongs to the range of In^. Then a 
substitution of integration variables v = lia^{w) is possible. One finds, using 
dv/dw = l/(j){w) and ip{v) = 0(exp^(t>)) = 0(w), 

dv^/j{v) = / dw 

Jl 

= exp^(M) - 1. (36) 

Using exp^(— cxd) = one concludes 

In case M = sup^ln(^(f) is finite and u > M then tpiy) = +oo for 
V G [M, u] . One has 

dv^{v) > I dvip{v) 

+ 00 

dw 







= +00. (37) 

But also the l.h.s. of ()35|1 is infinite. Hence the equality holds. 

Finally, if m = inf^,ln<^(f) is finite and u < m then il){v) = holds for 
V < m. Hence 

u pm 

dv^p{v) = / df?/'(f) 
Jo 
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This ends the proof. 



(38) 
□ 



Proposition 2 The function exp^ is continuous on the open interval of 
points where it does not diverge. 

Proof Let m and M be as in the proof of the previous proposition. Then 
exp0 is different iable on (m, M). If m = — oo this ends the proof. If m is 
finite then it suffices to verify that exp^(M) is continuous in m = m. But this 
is straightforward. 

□ 



Example 2 Let (j){u) = u'^ with g > 0. This function is increasing and 
strictly positive on (0, +cxd). Hence, it defines a ^-logarithm which will be 
denoted In^ and is given by 



r 1 

' Jl VI 

- 1 



ifg^l 



1-q 

= log{u) i{q=l. (39) 

This deformed logarithm has been introduced in the context of nonextensive 
statistical physics in ^Jj. The inverse function is denoted exp^ and is given 

by 

exp^(M) = [l + (l-q)u]f'-'\ (40) 
The function ip is then given by 

^(^u) = [l + il-q)u]f'-''\ (41) 

Example 3 Let 0(x) = [x] , the smallest integer not smaller than x. This 
piecewise constant function is increasing and strictly positive on (0, +oo). 
Hence, In,^ is piecewise linear. The function ip is given by 

iIj{x) = if X < — 1 

= 0(1 + x) otherwise. (42) 

The 0-exponential exp^ is also piecewise linear and satisfies 

exp^(x) = x<-l. (43) 
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5 The (/)-exponential family 



Let (j) be given as in the previous section. Fix a measure space ft, /i and a 
set of random variables Ck,k = 1, - ■ ■ ,n. The 0-exponential family of pdfs 
is defined by 

Pe{x)=exp^{G{9)-9''ck{x)). (44) 



The domain D is an open set of 6 for which G{6) exists such that 
is properly normahzed, i.e. pg G A1i(/i). The distributions are the 
equilibrium pdfs of generahzed thermostatistics as introduced in [T2| IT3]. 

Proposition 3 The function G{9) is concave on D. 

Proof Assume 9, rj and X9 + {1 — A)?7 in D for some A in [0, 1]. Then, using 
convexity of exp^, 

exp^ (AG(0) + (1 - \)G{r]) - [A^^'^ + (1 - \)v'']ck{x)) 

< Xpe{x) + (1 - \)p„{x). (45) 

Hence 

f d/i(x) exp^ {XGi9) + (1 - X)Gi7]) - [X9'' + (1 - X)r]k]ckix)) < 1. (46) 

Since exp^ is increasing one concludes that 

XGi9) + (1 - A)G(r/) < GiX9 + (1 - X)r]). (47) 
Hence G is concave. 

□ 

Proposition 4 Let ip be determined by via |^j). // the integral 

Z{9)= ! d^i{x)i){G{9)-9^Ck{x)) (48) 



converges for all 9 E D, then (pe)eg£) h^s an escort family {Pg)^^^, given by 

= otherwise. (49) 
Condition n7]\) is satisfied. 
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Proof One has 

<P{pe{x)) = 0(exp^(G(e)-e'=Cfc(a;))) 

= ^lj{G{e)-e''ck{x)). (50) 

Because (j){pg{x)) cannot be zero for yu-almost all x one concludes that Z{6) > 
and that Pg is properly normalized. 

From the properties of the function exp^ follows immediately that 

^Mx) = ^{G{e)-e>^c,{x))^{G{e)-e"^Ux)) 

= Z{9)Pe{x)^{G{9)-9"'cUx)). (51) 
This proves that {Pd)Q(^jj satisfies (fTUj) . 

□ 



Example 2 continued Let (f){u) = u'^ as in Example 2 above. The pdfs 
pe are given by 

peix) = [l + (l-q)(G{e)-9''c,ix))Yl^'-'\ (52) 
for 6* in a suitable domain D. The escort probabilities are 

Po{x) = ^^[l + {l-q){G{e)-e'c,{x))]f'''^ (53) 

with 

Zi9) = [ dix{x)[l + {l-q){G{9)-9''c,{x))]f'-'^ (54) 
Jn 

(assuming convergence of these integrals). The family {pe)Q^D coincides with 
Amari's a-family jSj, with a given by a = 2g — 1. 

Example 1 continued Example 1 is the q = 0-limit of example 2. Let 
(f){u) = 1 for all u > 0. Then 

lrL(f,{u) = u — 1 

exp^(M) = [1 + u]+ 

'ip{u) = 1 if M > -1; 

= otherwise. (55) 



One has 

Peix) 



1-? 
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/2 2x\ , , 

= exp^ (^^ - 1 - -^J • (56) 

This is a 0-exponential family with parameter Q = \j(P' ^ estimator c{x) = 2x 
and scale function G{Q) = The escort probabilities, making inequality 

^ optimally satisfied, are given by 

Peix) = ^lo<.<e. (57) 

The information matrix g{Q) equals 6'^/3. Further is ¥qc = 9 and Fec^ = 
4^2/3 and 

^F(e) =E©c=2^/3 = 2/3v^. (58) 

It is now straightforward to verify that the inequality is optimally satis- 
fied. 



6 Divergences 



Divergences of the Bregman type are needed for what follows. In the form 
given below they have been introduced in p!^. 

Fix a strictly positive increasing function of [0, +oo). Introduce 



DM\P) 



d/i(x) 



p(x) 



p'(x) 



du [ln<^(M) - ln<^(p'(x))] . 



(59) 



D(i){j)\\p') > follows because In^ is an increasing function. Also convexity 
in the first argument follows because In,^ is an increasing function. 

Let (p6»)eg_D 0-exponential. Then infinitesimal variation of the diver- 
gence D^{p\\p') reproduces the metric tensor gki{6), up to a scalar function. 
Indeed, one has 



d 



■D^{pe\\p^)\r,= 







d 

Q^D^{pe\\p^)\^^e = 



(60) 



and 



q2 

d 



d 

d/i(x) [In^ (peix)) - In^ {p^{x))] g^Pe{x] 



■q=e 



12 



1 



_d_ 



pe[x) 



d_ 



Pe[x) 



gum. 



m 

Similar calculations give 
92 



(61) 



dQ^drf 



jj=e drfdr] 



n=B z{e) 



9km . (62) 



7 Information content 

In ^3] the definition of deformed logarithm contains the additional condition 
that the integral 



u 



/•! 

du hi4,{u) = du 
1 Jo (PW 



< +00 



(63) 



converges. This condition is needed in the definition of entropy functional 
/ information content based on the deformed logarithm. Introduce another 
strictly increasing positive function x by 



u 



l/v 





(64) 



The motivation for introducing this function comes from the fact that it 
satisfies the following property. 



Lemma 1 



t;ln^(l/f) 



u 



^"0(«)' 



Proof 



^vhiJl/v) = hiJl/v) ,_, . , 

dv ' ^ x\ I J vx{l/v) 

du — / du 



1 X{.u) V Jo 0(m) 



l/v 



du 



du 



l/u 



Z 



'0 

1 



u 



'dz^-- r du^ 

(P{Z) V Jq (f){u) 



(65) 
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(66) 



which is the desired result. 



□ 



Define information content (also called entropy functional) /</,(p) of a pdf 
p in Mi{iJi) by 



hip) 



n 



diJ,{x)p{x) ln^(l/p(a;)) 



(67) 



whenever the integral converges. Using the lemma one verifies immediately 
that hip) is a concave function of p. A short calculation gives 

l/p{x) 



hip) = / dAi(x)p(x) / dw— - 



/ d//(a;)p(a;, , , ^- 

Jn Ji Xi^/v) V 

fp{x) r I'v 

(lli{x)p{x) / du 



u 



-I' 

Jn 



diJ,{x)p{x) 



V 

1 



x(i) 



p(x)x(l/p(x)) x(l) 

dfj,{x) / du ln^{u). 
n Jo 



\n4>{p{x)) 



(68) 



This implies that 



hip) ~ hip') ^ ~ d/x(x) / du In. 

Jn Jp'{x) 



iu), 



(69) 



and hence 



DM\p') = hip')-hip)- f Mx)ipix)-p'ix))'^^<t>ip'i^))- (70) 

Jn 

This relation links the divergence D^{p\\p') with the information function 
hip)- 

The following result shows that the 0-exponential family is a conditional 
maximizer of I^. It also shows that the scale function F is the Legendre 
transform of the information content 7,^ 
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Theorem 2 Let {pe)0(zj:, be (j)- exponential, with estimator c and scale func- 
tions F and G. Then there exists a constant Fq such that 

F{d) = Fo+ mill {E,e'ck-I^ip)}. (71) 

The minimum is attained for p = pe. In particular, F{6) is a convex function 
of 9 and Pe maximizes I^{p) under the constraint that 

E/'^Cfc = E.^'^Cfc. (72) 

Proof Let us first show that for any pdf p 

^pO^Ck - IM > Eee'^Ck - I^pe). (73) 

One has 

dn{x) {p{x) - peix)) In^ (peix)) 

df^ix) ipix)-peix)) [Gi9)-9''c,] 
= -iEp-Ee)9''ck. (74) 
Hence, (I7n|) becomes now 

D^{p\ \pe) = IM - hip) + (Ep - Ee)^^'cfc. (75) 

But one has always Drj,(j)\\pe) > 0. Therefore, (f7H|l foUows. 
Next calculate, using the lemma, 

d ^ , . , , , u \ d 



a. 



39^ 



I<p{pe) = J d/i(x) (^-\n^{pe{x)) - ^j^^ W^^^^'^ 

d/i(x) (^-G{9) + 9'ci{x) - dw^^ W^'^'^'l 
d 

d/i(x) {9^ci{x)) gg^Peix) 

-{Eg9'ci)-EeCk. (76) 



d 



09'' 

Because c is an estimator with scale function F one obtains 

^{Ee9^c,-IM) = -^Fi9). (77) 
Hence there exists a constant Fq for which 

F{9)=Fo + Ee9'ci-I^ipe). (78) 
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In combination with (|7S|) this results in (f7T|) . 

□ 



Without restriction one can assume Fq = 0. In statistical physics the 
function F{6) is the free energy as a function of temperature, up to some 
proportionality factor. 



Example 4 Let (f){u) = u'^~'^/q, with < g < 2. This is of course only a 
reparametrization of example 2, which is done to recover expressions found 
in the literature. The deformed logarithm is given by 

IMu) = ^(n^-i-l) ifg^l 

= log(M) if g = 1. (79) 

One obtains x(^) = and hence 

W = y"d^(a;)p(a;)i^^^. (80) 

This is the entropy functional proposed by Tsallis |2^ as a basis for nonexten- 
sive thermostatistics, and reported earlier in the literature by Havrda and 
Charvat JH] and by Daroczy J7]. The corresponding expression for the 
divergence is 

d4p,p') = -l^Jdfi{x)p{x)[p{xr-'-p'{xr~'] 

-J d/x(x) [p{x) -p\x)]p'{xy'\ (81) 



8 Dual coordinates 

Introduce dual coordinates 



dF 

rik = Egck = (82) 



Assume (fTUj) holds. Then, one obtains from (fT^ 
drjk d 

= -Zi9)[¥eckci-{¥eck){¥gci)] 
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m 



To obtain the last line a 0-exponential family has been assumed. This relation 
implies 



09'' 
drji 



-z{e)g'\e). 



Thes are the orthogonality relations between the two sets of coordinates 6 
and rj. Next we derive the dual relation of (jH2I)- 

Proposition 5 Let {pe)^^^ be (p- exponential. Assume the regularity condi- 
tion (0) is satisfied. Then 



d_ 

dm 



hive 



^5) 



Proof One calculates (assume integration and partial derivative can be in- 
terchanged), using Lemma ((H), 



d 



h{Pe) 



d/i(x) 
dyu(x) 



In,^ {pe{x)) + 



u 



G{9)-e'ci{x) + 
d 



<P{u) 
du - 



d_ 



pe[x) 



u 



d 



u)\ de>' 



Pe{x) 



d/i(x) e^ci{x)^pe{x). 



(86) 



To obtain the last line the regularity condition has been used. Use now that 
pe satisfies One obtains 



d_ 



I^ipe) = Z{9)¥e9'ci{¥eCk-Ck) 
= -Z{9)9'gik{9). 



^7) 



In combination with (|8Hl this gives 



d_ 
drji 



hijPe) 



d_ 



h{Pe 



89'' 
drji 



-Z{9)9"'grra{9)) 



Z{9) 



g'\9) 



9'. 



□ 
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Equation (jH^ is the dual relation of (jHH)- Expression (fTHj) can now be 
written as 

F{e) + E{7^) = e% (89) 

with E{r]) = I^{pe)- 

9 Discussion 

The present paper introduces generalized exponential families, and calls them 
0-exponential because they depend on the choice of a strictly positive non- 
decreasing function of (0, +oo). Several properties, known to hold for the 
exponential family, can be generalized. The paper starts with a generalization 
of the well-known lower bound of Cramer and Rao, involving the concept of 
escort probability distributions. It is shown that the 0-exponential family 
optimizes this generalized lower bound. The metric tensor, which generalizes 
the Fisher information, depends on both the family of pdfs and the escort 
family, and determines the geometry of the statistical manifold. 

In the final part of the paper deals with the dual structure of the sta- 
tistical manifold. In statistical physics this duality maps mean energy onto 
temperature and entropy onto free energy. The dual structure is shown to 
exist in the general context of 0-exponential families. 

Throughout the paper the number of parameters n has been assumed to 
be finite. A non-parametrized approach to statistical manifolds is found in 
jl8j . The extension of the present work to this more abstract context has 
not been considered. 
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